aggressive augmentation
RSA: Reducing Semantic Shift from Aggressive Augmentations for Self-supervised Learning
Most recent self-supervised learning methods learn visual representation by contrasting different augmented views of images. Compared with supervised learning, more aggressive augmentations have been introduced to further improve the diversity of training pairs. However, aggressive augmentations may distort images' structures leading to a severe semantic shift problem that augmented views of the same image may not share the same semantics, thus degrading the transfer performance. To address this problem, we propose a new SSL paradigm, which counteracts the impact of semantic shift by balancing the role of weak and aggressively augmented pairs. Specifically, semantically inconsistent pairs are of minority, and we treat them as noisy pairs.
22f791da07b0d8a2504c2537c560001c-AuthorFeedback.pdf
We thank the reviewers for their feedback. We also acknowledge that more experiments in 4.2 (R3) The drop in performance could be attributed to domain gap between COCO-full and P ASCAL-cropped. Y et, COCO-Boxes model outperforms COCO-full. To support it even further, we do another experiment. MOCO should not be as robust as supervised models on part-classifcation task.
RSA: Reducing Semantic Shift from Aggressive Augmentations for Self-supervised Learning
Most recent self-supervised learning methods learn visual representation by contrasting different augmented views of images. Compared with supervised learning, more aggressive augmentations have been introduced to further improve the diversity of training pairs. However, aggressive augmentations may distort images' structures leading to a severe semantic shift problem that augmented views of the same image may not share the same semantics, thus degrading the transfer performance. To address this problem, we propose a new SSL paradigm, which counteracts the impact of semantic shift by balancing the role of weak and aggressively augmented pairs. Specifically, semantically inconsistent pairs are of minority, and we treat them as noisy pairs.